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FIELD OF THE INVENTION 
5 The present invention relates to digital processors, and more particularly to a 

method and apparatus for servicing hardware interrupts with greater frequency and 
efficiency. 

BACKGROUND OF THE INVENTION 
10 Interrupts are increasingly irregular inputs to a digital processor that cause 

high-overhead context switches whereby the processor halts its current program 
execution, saves its current context or status, services the interrupt, restores the 
interrupted context or status and resumes execution of the interrupted program. 
Examples of interrupts are input/output events that are not within the control of the 
15 digital processor to predict or to handle as anything other than an exception. Such 

exception handling is referred to in the computer world as a context switch. A context 
switch carries with it an especially high overhead in modem processors, which are 
equipped with instruction pre-fetch queues and look-ahead logic and instruction 
pipelines and caches that permit high speed computation. Intermediate results of 
20 processor operations must be discarded, e.g. by flushing the instruction pre-fetch 
cache, and much time is wasted in traditional response to an interrupt. Because 
interrupt processing is controlled by the digital processor hardware, its priority in time 
is secondary to the regular, streamlined code and data processing performed by the 
processor. 

25 The trend is toward increased speed and functionality in computers such as 

personal computers (PCs), workstations, network servers and the like. But part of the 
increased speed and functionality is an increase in input/output (I/O) bandwidth and 
variety. Inputs to computers include keyboards, mouses, joysticks, video cards, 
musical instrument digital interfaces (MIDIs), compact disc read-only memory 

30 devices (CD-ROMs), small computer systems interface (SCSIs), disc drives, zip 
drives, fax-modems, e-mail and Internet downloads. All such inputs generate 
hardware interrupts to the processor. All such interrupts to a conventional processor 
requires a context switch in which the processor halts its current instruction flow. 
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flushes the intermediate results of instruction pre-fetched macro-code execution, saves 
its address counters and status registers, and performs a branch to unrelated interrupt 
service routine code. After each hardware interrupt, the processor restores its status 
registers and address counters and resumes what it was doing. Such hardware 
5 interrupts are many — a busy mouse may generate hundreds of interrupts per second. 
Disc drives and video cards may generate thousands of interrupts per second, each. A 
single Internet application, while running on your personal computer, may generate 
tens of thousands of interrupts per second. The context switching overhead adversely 
impacts the performance of conventional processors each and every time one of these 

10 hardware interrupts occurs. 

A video card generates a relatively low volume of interrupts at fairly regular 
intervals, e.g. when the video cache expires. A mouse or a joystick generates a high 
volume of interrupts at fairly regular intervals, at least while the mouse or joystick is 
in use. A disc drive generates a relatively high volume of interrupts at irregular 

15 intervals. A modem that is being used in a network application similarly generates a 
relatively high volume of interrupts, also at irregular intervals. Because many 
network applications such as downloading files or page swapping are themselves disc- 
intensive, the pace of hardware interrupts from disc drives and modems in network 
applications are the most difficult to predict. Accordingly, network applications 

20 represent a significant challenge in the design of computers that are flexible and 
responsive to hardware interrupt servicing. 



SUMMARY 

The method includes detecting and prioritizing one or more interrupt service 
25 requests; inserting interrupt servicing instructions responsive to the interrupt service 
request into an instruction queue mechanism; and processing the instructions within 
the instruction queue mechanism including the inserted interrupt servicing 
instructions. The instruction queue mechanism may include an instruction cache and 
an instruction fetch unit for fetching instructions from the instruction cache, wherein 
30 the processing includes decoding the instructions into micro-opcodes and executing 
the micro-opcodes in one or more out-of-order execution units. The method further 
includes retiring the executed micro-opcodes including those micro-opcodes 
representing the inserted interrupt servicing instructions to the instruction cache. 
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Preferably, the criteria for interrupting the core processor include the priority of the 
interrupts and the capacity of the processor to allocate bandwidth to interrupt 
servicing. Most preferably, the prioritizing is dynamically responsive to changing 
allocation criteria, e.g. a current-usage model. In accordance with one embodiment, 
an interrupt processor determines whether the detected interrupt service request is of a 
priority meeting one or more defmed high-priority criteria and if so then signals the 
core processor to perform the inserting. Alternatively, the interrupt processor 
determines whether a natural core processor context switch is imminent and if so then 
signals the interrupt processor to make ready the highest priority interrupt service 
request and signals the instruction queue mechanism to fetch the readied interrupt 
service request in advance of the context switch. 

The apparatus takes the form of a digital processor for use in a computer 
supporting one or more hardware interrupt inputs. The processor includes an 
instruction cache and a fetch-and-decode unit, the fetch-and-decode unit fetching 
instructions from the instruction cache and decoding the instructions into micro- 
opcodes. The processor also includes a dispatch-and-execute unit having one or more 
execution ports, the dispatch-and-execute unit scheduling and executing the micro- 
opcodes in the one or more execution ports and thereafter retiring the micro-opcodes 
back into the instruction cache. Finally, the processor includes an interrupt-handling 
mechanism responsive to one or more hardware interrupt inputs, the interrupt- 
handling mechanism instructing the fetch-and-decode unit to insert into a normal 
instruction sequence decoded micro-opcodes representing interrupt servicing 
instructions for scheduling and execution by the dispatch-and-execute unit. 

The foregoing and other objects, features and advantages of the invention will 
become more readily apparent from the following detailed description of a preferred 
embodiment which proceeds with reference to the drawings. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Fig. 1 is a block diagram of apparatus configured in accordance with the 
invention by which core processor bandwidth is allocated between routine processing 
and interrupt servicing. 
5 Fig. 2 is a schematic diagram of the interrupt concentrator that forms a part of 

the apparatus of Fig. 1 . 

Figs. 3A and 3B are state diagrams of the invented signaling mechanisms 
among the interrupt concentrator, instruction fetch unit and p-code decoder that form a 
part of the core processor architecture of Fig. 1, the state diagrams corresponding with 
10 two different embodiments of the interrupt concentrator of Fig. 2. 

Fig. 4 is a flow chart of the core processor bandwidth-allocation method in 
accordance with the invention. 



DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

15 Fig. 1 is a simplified system block diagram of the apparatus in accordance with 

the invention. The apparatus may be seen to include plural hardware interrupt signals 
12a, 12b, ... 12n for coupling to one or more hardware input/output (I/O) devices, not 
shown. It will be appreciated that the I/O devices generate hardware interrupt signals 
conventionally, the interrupt signals identifying the I/O device that generated the 

20 interrupt. Typical UO devices include keyboards, mouses, joysticks, video cards, 

MIDIs, CD-ROMs, disc drives, zip drives, SCSIs, modems, etc. Modems are used as 
telecommunication vehicles for facsimile, e-mail, chat rooms, and world-wide web 
(network) browsing with its multimedia content, and the demand for high- 
performance telecommunications in personal computers, workstations and Internet 

25 service platforms grows daily. 

Interrupt signals 12 are input to an interrupt concentrator 14, in accordance 
with the invention. Interrupt concentrator 14 detects one or more interrupts 
represented by interrupt signals 12 and ranks them in accordance with a determined 
priority scheme. Higher priority interrupts are handled before lower priority 

30 interrupts, as is typical. In accordance with the invention, interrupt concentrator 14 
determines when a ranked hardware interrupt should be serviced, i.e. when a 
predefined demand threshold is met. Via a special port indicated in Fig. 1 by signal 
paths extending therefi-om, interrupt concentrator 14 signals a core processor 16. Core 
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processor includes a p-code decoder 18, and an instruction fetch unit 20. P-code 
decoder 18 is coupled with instruction fetch unit 20. Interrupt concentrator 14, core 
processor 16, p-code decoder 18 and instruction fetch unit 20 will be described in 
more detail by reference below to Figs. 2, 3A and 3B. 
5 Referring still to Fig. 1, core processor 16 may be seen also to include an 

instruction cache 22 coupled with the input of instruction fetch unit 20. Core 
processor 16 may be seen fiirther to include an out-of-order (OOO) execution unit 24 
coupled with the output of p-code decoder 18. Finally, core processor 16 may be seen 
to include a p-code retirement unit 26 coupled between OOO execution unit 24 and 

10 instruction cache 22. Each of these additional functional blocks forming a part of core 
processor 16 will be described in turn. 

Instruction fetch unit 20 fetches instructions from memory and stores them in 
consecutive order within instruction cache 22. Instruction fetch unit 20 thus stages 
instructions for decoding by p-code decoder 1 8 to which its output instruction stream 

15 is routed. Instruction fetch unit 20 and instruction cache 22 will be referred to herein 
collectively as an instruction queue mechanism, indicated generally at 28. 
Instruction fetch unit 20 reads a predefined number of bytes, e.g. thirty-two bytes, 
from instruction cache 22 and routes some lesser number of bytes, e.g. sixteen bytes, 
to p-code decoder 18. 

20 Instruction fetch unit 20 then hypothesizes what the next needed instructions 

will be by analyzing any branch instructions therein as to whether or not they are 
likely to be taken. Such analysis may include different prediction criteria for branches 
not before seen and those seen before. Branch predictions not seen before may be 
deferred to p-code decoder 18, which may use a static branch prediction algorithm to 

25 be described below. For branches that have been seen before, instruction fetch unit 20 
may analyze historic information stored in a branch target buffer (not shown) within 
instruction fetch unit 20. Such instruction pre-fetching as is performed in accordance 
with the invention by instruction fetch unit 20 may also be referred to herein as 
instruction streaming. 

30 P-code decoder 1 8 decodes the instructions received from instruction fetch 

unit 20 into a set of typically multiple, reduced instructions called micro-ops (|iOPs). 
Preferably, p-code decoder 18 includes three parallel decoders and a micro-code 
instruction sequencer, not shown. Micro-ops are routed to a register alias table 

Patent application 5 do. no. 5038-33 1 



(RAT), also not shown. In the RAT, any register references and status information 
implicit within the |iOPs are incorporated before the |xOPs are stored in an instruction 
pool/reorder buffer (ROB), also not shown. The three parallel decoders have different 
capabilities for optimized decoding of instructions into ^lOPs. One of them is able to 
5 decode an instruction into 1-4 jaOPs, while the other two can decode an instruction 
only into one ^lOP. Complex instructions, i.e. macro opcodes or Macro-Ops, in 
accordance with a complex instruction set (CISC) processor architecture are decoded 
by the microcode instruction sequencer into reduced instructions, e.g. ^lOPs, in 
accordance with a reduced instruction set (RISC) processor architecture. 

10 Out-of-order execution unit 24 schedules and executes the jiOPs stored in the 

ROB that is loaded by p-code decoder 18, A reservation station (RS) within out-of- 
order execution unit 24 continuously scans the ROB for jxOPs that are ready to be 
executed and dispatches them to one or more of five execution ports. The five 
execution ports preferably include a first port including an integer unit, a floating 

15 point unit, an address generation unit, a packed arithmetic logic unit (ALU) and a 
packed multiply unit. A second port includes an integer unit, a packed ALU and a 
packed shift unit. The third through fifth ports include a load unit, a store address 
calculation unit and a store data unit. 

Instruction retirement unit 26 simply stores the executed instructions from 

20 OOO execution unit 24, in their original consecutive order, into instruction cache 22 
for future re-use by instruction fetch unit 20. Those of skill in the art will appreciate 
that instruction retirement ensures that instruction cache 22 thus will contain 
instructions that may be required by instruction fetch unit 20 when instruction fetch 
unit 20 performs its next instruction pre-fetch. This avoids the latencies related to 

25 memory bus contention and read access that otherwise would slow processing if the 
same instruction sequence is required to be performed again, as may often be the case. 

Interrupt service can be very demanding due to its high volume and 
irregularity. Thus, in accordance with the invention, it is desired to involve high- 
speed core processor 16 in the routine servicing of interrupts. This is accomplished in 

30 accordance with the invention by inserting p-code into the above-described core 
processing of instructions. Thus the invented method and apparatus minimize the 
overhead associated with a costly context switch that effectively halts core processor 
1 6; saves the program instruction counter; saves register contents and context flags; 
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flushes instruction cache 22; executes the interrupt service routine; restores the 
register contents and context flags; restores the program instruction counter; re-loads 
instruction cache 22 from memory; and resumes execution of the interrupted program. 
Those of skill in the art will appreciate that the invention is useful and greatly 

5 beneficial even if there is no retirement of executed ^lOPs to instruction cache 22. Of 
course, there will be a latency penalty when the interrupt vector, e.g. an interrupt 
service routine (ISR), is required to be executed again and it cannot be found in 
instruction cache 22. Nevertheless, there is still the great advantage of having core 
processor 16 responsive to interrupt service p-code inserted into the normal p-code 

10 execution stream, which minimizes disruption from context switching. Modem core 
processors such as core processor 16 provide such instruction recycling logic as is 
represented by instruction retirement unit 26 coupled with instruction cache 22. In 
this way, it is possible in accordance with the invention to realize an additional 
advantage: the ISR may be already staged for re-execution upon a like interrupt signal 

15 from interrupt concentrator 14. 

Fig. 2 is a schematic block diagram of interrupt concentrator 14, which will be 
described in more detail below. In this embodiment, interrupt concentrator 14 
includes interrupt control logic for signaling p-code decoder 18 and instruction fetch 
unit 20 in anticipation of an impending interrupt event. Pending event will be 

20 understood to refer to a scheduled interrupt as determined by interrupt concentrator 14 
to be ready to process within a period of time that is predetermined to be relatively 
short. Such signaling by interrupt concentrator 14 may be accomplished by simply 
asserting a pending interrupt request signal to instruction fetch unit 20 to move the 
requested interrupt vector into instruction cache 22 and by simply informing p-code 

25 decoder 1 8 to expect a code-ready signal from instruction fetch unit 20. 

Instruction fetch unit 20 responds to such a signal from interrupt concentrator 
14 by determining whether the requested interrupt vector already is in instruction 
cache 22, in which case instruction fetch unit 20 simply sends a code-ready signal to 
p-code decoder 18. Alternatively, instruction fetch unit 20 responds to such a signal 

30 from interrupt concentrator 14 by fetching, e.g. performing a memory read cycle 
followed by a cache write cycle of, the requested interrupt vector into instruction 
cache 22 and then sending a code-ready signal to p-code decoder 18. P-code decoder 
18 responds to the signal from interrupt concentrator 14 by arming an embedded state 
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machine for the ISR p-code arrival from instruction fetch unit 20. Meanwhile, p-code 
decoder 18 runs out the normal instructions already within instruction fetch unit 20. 
In other words, p-code decoder 18 continues to operate normally by reading CISC 
instructions from instruction cache 22, decoding the instructions into RISC 

5 instructions and feeding the RISC instructions to OOO execution unit 24. 

It is possible that instruction cache 22— which has been starved since 
instruction fetch unit 20 was signaled by interrupt concentrator 14 — ^will run out of 
instructions before the ISR code is loaded therein. In this case, p-code decoder 18 
halts momentarily (e.g. a few master clock cycles) until the arrival of the code-ready 

10 signal. While this represents a worst-case scenario, it may be avoided or its adverse 
impact minimized by fine-tuning the timing by which interrupt concentrator 14 signals 
instruction fetch unit 20 and p-code decoder 18 in anticipation of an interrupt event. 

It is also possible that while p-code decoder unit 18 awaits receipt of the code- 
ready signal from instruction fetch unit 20, interrupt concentrator 14 may trigger the 

15 interrupt, i.e. the pending interrupt that was earlier signaled by interrupt concentrator 
may ripen into a present interrupt event. In this case, a context switch occurs in which 
the ISR is executed as soon as one or more instructions reach instruction cache 22 and 
data or other needed operands are available. While this represents less-than-optimal 
performance in accordance with the invention, its adverse impact may be reduced also 

20 by fine-tuning the timing by which interrupt concentrator 14 signals instruction fetch 
unit 20 and p-code decoder 1 8 in anticipation of an interrupt event. 

Most probably, upon receipt of the code-ready signal from instruction fetch 
unit 20 indicating that one or more ISR instructions have been loaded into instruction 
cache 22, p-code decoder 18 begins decoding the interrupt service routine code as it is 

25 read from instruction cache 22, This represents a best-case scenario in which the 
greatest potential advantage of the invention in increasing the efficiency and 
responsiveness of core processor 16 to hardware interrupts is realized. It is also the 
most likely case, when instruction fetch latency is well-defined and relatively constant 
and when the timing by which interrupt concentrator 14 anticipates an interrupt 

30 service requirement is properly tuned. Those of skill in the art will appreciate that 

such timing may be variable and adaptive to dynamically changing rates of instruction 
fetch unit supply and OOO execution unit and interrupt concentrator demand. 
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Those of skill in the art will appreciate that the interrupt vector typically is an 
interrupt service routine (ISR) represented, for example, by a starting address in 
memory of the ISR. Under interrupt chaining conventions, plural interrupts at the 
same level typically share an interrupt line while representing plural distinct interrupt 

5 service requirements. In these chained-interrupt cases, the interrupt vector may be 
two or more ISRs represented, for example, by a starting address in memory of the 
first of plural chained ISRs. Chained ISRs by their own chaining protocol typically 
execute in succession as though they were a single, longer routine. In other words, 
their chaining involves the continued execution of any and all routines linked within 

10 the ISR chain. 

Accordingly, those of skill in the art will appreciate that when interrupt 
concentrator 14 signals p-code decoder 18 and/or instruction fetch unit 20 with an 
interrupt vector, the interrupt vector may represent one or more ISRs to be executed 
back to back. Those of skill in the art also will appreciate that the number of ISRs 

15 represented by an interrupt vector is transparent to the invention in accordance with 
the embodiments described and illustrated herein. In other words, the invention need 
not know whether or not ISRs are chained and does not care. 

In an alternative embodiment of interrupt concentrator 14, p-code decoder 18 
includes interrupt insertion logic for signaling interrupt concentrator 14 of an 

20 impending natural context switch. A natural context switch represents an opportune 
time for insertion of interrupt servicing p-code into the normal instruction process. 
One example of an impending natural context switch is anticipation of a macro-code 
boundary, i.e. a new instruction, scheduled for imminent decode and execution. 
Another example of an impending natural context switch is an immediate 

25 unconditional direct branch instruction, e.g. jmp label, or an instruction that pushes an 
address to an address stack, e.g. a pushae or pushad. In either case, a context switch is 
imminent and inevitable, which presents a near-term opportunity for efficient interrupt 
service processing. 

In accordance with the interrupt insertion embodiment of the invention, 

30 interrupt concentrator 14, in response to the impending natural context switch signal 
from p-code decoder 1 8, readies the highest-priority interrupt vector from a pending 
"hot" list representing concentrated hardware interrupt requests. Then interrupt 
concentrator 14 directly signals instruction fetch unit 20 to fetch the highest priority 

Patent application 9 ix>. no. 5038-33 1 



requested interrupt vector (typically an ISR) into instruction cache 22. Meanwhile, p- 
code decoder 18 no longer requests normal code from cache 22 via instruction fetch 
unit 20. (Out-of-order execution unit 24 eventually stalls in this scenario, as intended, 
so that interrupt servicing begins at the opportune threshold of the natural context 
switch that triggered the interrupt servicing,) When instruction cache 22 is loaded 
with the highest priority interrupt vector (code), in response to the direct signaling 
from interrupt concentrator 14, instruction fetch unit 20 signals p-code decoder 18 of 
the fact. 

As soon as one or more ISR instructions is in process within p-code decoder 
18, p-code decoder 18 signals interrupt concentrator 14 to de-assert the interrupt 
request signal, or interrupt event, line from the interrupt concentrator's interrupt 
control logic. (Those of skill in the art will appreciate that the actual interrupt line 
from the hardware that produced the interrupt is de-asserted ultimately by the 
hardware itself in response to some form of acknowledgement from the ISR servicing 
the interrupt, e.g. the setting or clearing of an interface status bit.) 

P-code decoder 18 then tracks how much time or how many events there are to 
service until normal service may resume. P-code decoder 18 signals interrupt 
concentrator 14 if more interrupts may be serviced during the present interrupt 
insertion cycle of core processor 16. Again, interrupt concentrator 14 readies the 
highest-priority pending interrupt vector and signals op-code decoder 18 to signal 
instruction fetch unit 20 to fetch the interrupt vector (code) corresponding therewith 
into instruction cache 22. The signaling processes between interrupt concentrator 14 
and p-code decoder 18, and those between instruction fetch unit 20 and p-code 
decoder 18, are repeated for so long as a priority interrupt is pending and there is 
available opportunity for interrupt processing within core processor 16. Thereafter, 
interrupt concentrator 14 signals p-code decoder 18 that the pending priority interrupt 
queue, or hot list, is empty. Such interrupt insertion will be understood effectively to 
achieve so-called hardware interrupt service 'pipelining' that is unprecedented in core 
processing architectures. 

The timing of the interrupt insertion cycle, i.e. the duration of the open 
window of opportunity for efficient interrupt servicing, may be programmable. 
Typically a so-called 'time-slice' is defined during which p-code decoder 18 and 
interrupt concentrator 14 may continue interrupt processing of any hot list vectors. 
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Those of skill will appreciate that all hot interrupts may be serviced within the 
established time-slice, in which case p-code decoder unit 18 signals instruction fetch 
unit 20 to resume normal code caching. When the established time-slice nears 
expiration, i.e. the interrupt service window is closing, then p-code decoder 18 signals 
5 interrupt concentrator 14 that the currently executing ISR is the last ISR that may be 
processed during the present time-slice. After the currently executing ISR is run to 
completion, p-code decoder 18 signals instruction fetch unit 20 to fetch the line code 
that was running when the natural context switch occurred. 

Ideally, of course, the processing of the last hot interrupt event will be just 

10 completed within the interrupt handling time-slice so that normal processing of 
instructions begins just as interrupt processing ends. It is possible within the spirit 
and scope of the invention to approach this ideal scenario by adaptive techniques. For 
example, the duration of the time-slice may be a programmed variable based at leeist 
in part upon which applications are running on the computer. Other contextual 

1 5 information may be used to make the time-slice dynamically responsive to real-time 
interrupt servicing, perhaps in accordance with current-usage models and/or predictive 
behavior models. 

Those of skill in the art will appreciate that interrupt concentrator 14 may be 
implemented, within the spirit and scope of the invention, in a combination of 

20 hardware and firmware, with programmable variables being implemented preferably 
in software. Such programmable variables preferably include interrupt priority and 
timing criteria as well as core processor bandwidth utilization criteria. They may be 
programmed by a resident operating system (OS) and/or by OOO execution unit 24, 
by way of initialization and feedback, respectively. For example, the criteria may 

25 initially be established by the OS, which may assume a general usage model for all 
applications running thereon, including those that may be interrupt-intensive. 
Thereafter, the criteria may be changed in dynamic response to a particular usage 
model as determined by OOO execution unit 24 or auxiliary part of core processor 16 
and based upon actual utilization of core processor bandwidth. Most preferably, the 

30 criteria for interrupt processing relative to routine processing within core processor 16 
is dynamically responsive to a user. Thus, within the spirit and scope of the invention, 
current-usage models may dynamically govern processor bandwidth allocation within 
core processor 16 among routine and hardware interrupt tasks. 
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Those of skill in the art will appreciate that interrupt concentrator 14 may be 
programmed to give higher priority to disc drive and modem interrupts than to video 
card interrupts, thereby enhancing the performance of server applications. Interrupt 
concentrator 14 may be programmed to give higher priority to keyboard and mouse or 
5 joystick interrupts than to modem interrupts, thereby enhancing the performance of 
v^^ord processing or computer-aided design (CAD) applications. Interrupt concentrator 
14 may be programmed to give higher priority to video card and joystick interrupts 
than to disc drive interrupts, thereby enhancing the performance of video games. The 
invention thus involves the core processor intimately in the application-textured real 

10 world of hardware interrupts. 

Priority preferably thus is not fixed, but instead is dynamically responsive to 
changing computer applications and environments. In accordance with the invention, 
interrupt concentrator 14 is programmable so that the priority criteria may be adjusted 
in real-time response to changing computer application demands. One or more 

15 current-usage models may influence the priority criteria upon which interrupt 

concentrator 14 bases its ranking and timing of interrupt event handling, in real-time 
response to changing behaviors of computer applications or their users. 

Interrupt concentrator 14 will be understood by those of skill in the art 
preferably to provide interrupt queuing, priority contention and determination and 

20 multiplexing functions. The functions are performed in accordance with the invention 
in such manner as to permit the highest priority one or more interrupts to be 
acknowledged to the I/O device requesting service and queued for handling by p-code 
decoder 18 and optionally instruction fetch unit 20, as described above. Such queuing 
preferably involves identifying the hardware interrupt request level or port and time- 

25 stamping the request for priority purposes. One or more buffers may be included in 
interrupt concentrator 14, or such may form a part of port 16 to be described 
immediately below, such buffers being configured as interrupt queues that maintain 
the relative time and port information described above. The multiplexing function 
may be understood to be straightforward selection of interrupt requests into one or 

30 more of such buffers as it may be determined that the request is of sufficiently high 
priority. 

In accordance with the interrupt control method of operation of interrupt 
concentrator 14, interrupt concentrator 14 may be programmed to schedule an 
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interrupt event in accordance with one or more criteria. While two such criteria and 
their combination are described immediately below, those of skill in the art will 
appreciate that, within the spirit and scope of the invention, any suitable criterion or 
combination of criteria may be used. 
5 Interrupt concentrator 14 may signal p-code decoder 18 on a lapsed-time basis, 

e.g. every ten milliseconds (assuming there is an interrupt event to be scheduled). 
Those of skill in the art will appreciate that the period of such time-based signaling 
may range more broadly from fewer than hundreds of microseconds, e.g. SOOixs, to 
more than tens of milliseconds, e.g. 50ms, or any other suitable periods. 
10 Alternatively, interrupt concentrator 14 may signal p-code decoder 18 on an 

event-count-threshold basis, e.g. every ten interrupts. Those of skill in the art will 
appreciate that the threshold of such event count-based signaling may range more 
broadly from one to dozens or even hundreds of hardware interrupt events, or any 
other suitable counts. 

1 5 The two above and/or other criteria may be combined, within the spirit and 

scope of the invention. For example, interrupt concentrator 14 may signal p-code 
decoder 18 every 10ms or ten interrupts, whichever occurs first. Alternatively, 
interrupt concentrator 14 may signal p-code decoder 18 every 10ms and ten interrupts, 
whereby both a defined time lapse and a defined event count threshold must be met 

20 prior to signaling. 

Those skilled in the art will appreciate that interrupt concentrator 14 preferably 
is responsive to allocation criteria including not only the interrupt demand placed 
thereon by the hardware I/O devices but also the bandwidth of core processor 16. 
Thus, in accordance with the invention, the signaling criteria including the time 

25 between interrupts and/or the number of pending interrupts are programmable, e.g. by 
the resident operating system. By rendering the criteria programmable, the core 
processor's total bandwidth may be fairly allocated between normal and interrupt or 
exception handling. In other words, the resident operating system can program 
interrupt concentrator 14 to interrupt every 5ms or every 15ms, or every five interrupts 

30 or every fifteen interrupts, depending upon overall core processor bandwidth 

allocation algorithms that may be application-specific. Such variables within interrupt 
concentrator 14 most preferably are initialized to default values when the computer is 
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booted and periodically or continuously tailored or adjusted to instantaneous hardware 
interrupt demands and to instantaneous core processor bandwidth. 

Another aspect of the invention will be referred to herein as look-ahead 
interrupt servicing, or interrupt queuing within interrupt concentrator 14. In 
5 accordance with this aspect of the invention, an event pending message is sent out, i.e. 
interrupt concentrator 14 signals p-code decoder 18, some time before the actual event 
is triggered. This accommodates any unavoidable latency involved in signaling and 
readying instructions within cache 22 for decoding by p-code decoder 18. For 
example, if the event scheduling criteria is every ten interrupts, then the signaling 

10 might occur after the seventh interrupt, thereby accommodating an average latency of 
three interrupts. In this manner, interrupt concentrator 14 effectively signals p-code 
decoder 18 to prepare for interrupt servicing. 

The port P shown in Fig. 1 will be understood by those of skill in the art 
preferably to be a hardware port that includes the signal paths shown therein. It also 

15 may include one or more buffers, a control bus including one or more signal lines, and 
a data bus including one or more data signal lines. The control bus preferably provides 
for the signaling described above between interrupt concentrator 14 and p-code 
decoder 18. The control bus optionally may provide for the signaling described above 
between interrupt concentrator 14 and instruction fetch unit 20 and OOO execution 

20 unit 24. Such signaling as defines port P may be in accordance with any suitable 
protocol, within the spirit and scope of the invention. The buses and any included 
buffers are preferably high-speed and most preferably are configured for parallel data 
and/or control transfers among concentrator 14, p-code decoder 18, instruction fetch 
unit 20 and OOO execution unit 24. 

25 Figs. 3A and 3B are state diagrams illustrating the invented signaling 

mechanisms among interrupt concentrator 14, p-code decoder 18 and instruction fetch 
unit 20. Fig. 3 A illustrates an embodiment of the invention (corresponding with Fig. 
2) in which interrupt concentrator 14 controls the insertion of interrupt servicing into 
the normal operation of core processor 16 based upon demand, i.e. the occurrence of 

30 an impending interrupt event for which interrupt service is required. Fig. 3B 

illustrates an alternative embodiment of the invention (also corresponding with Fig. 2) 
in which p-code decoder 1 8 controls the insertion of interrupt servicing into the 
normal operation of core processor 16 based upon supply, i.e. the availability of an 
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impending time slice, e.g. a natural context switch, during which core processor 16 
readily may accommodate interrupt service. 

Referring to Fig. 3 A, interrupt concentrator 14 includes a state machine, 
preferably coded in software or firmware or a combination thereof. The state machine 
5 first signals p-code decoder 1 8 and instruction fetch unit 20 that there is an interrupt 
event pending, i.e. that an interrupt meeting the given priority criteria is imminent. In 
response to the signaling fi-om interrupt concentrator 14, a state machine forming a 
part of p-code decoder 18 transitions to a state of waiting. This first signaling, or 
signal mechanism, is denoted (1). 

10 In response to the signaling fi-om interrupt concentrator 14, a state machine 

forming a part of instruction fetch unit 20 determines whether the instruction stream 
representing the requested interrupt service routine already is in instruction cache 22 
(not shown in Fig. 3A, for clarity). 

If the interrupt service routine instructions are found already to reside within 

15 instruction cache 22, then the state machine of instruction fetch unit 20 immediately 
signals p-code decoder 18 with a code-ready signal. Such readiness typically would 
occur when the same interrupt service routine is needed routinely, and already has 
been fetched once, executed and retired to the instruction cache 22 as is customary. 
This second signaling, or signal mechanism, is denoted (2). 

20 If it is determined that the interrupt service routine instructions are not already 

within instruction cache 22, then the state machine of instruction fetch unit 20 
immediately fetches such instructions not already resident in instruction cache 22. It 
does so by reading them fi-om memory at the designated vector address, whether pre- 
assigned or designated by interrupt concentrator 14. The state machine of instruction 

25 fetch unit 20 enters a wait state until the interrupt service routine instructions, or at 
least some of them, are retrieved fi*om memory and placed in instruction cache 22. 
When the instructions are resident, the state machine of instruction fetch unit 20 
immediately signals p-code decoder 18 with a code-ready signal. This third signaling, 
or signal mechanism, is denoted (3). 

30 Upon such code-ready signaling by instruction fetch unit 20, p-code decoder 

1 8 transitions fi-om its wait state to an acknowledgement state in which it signals 
interrupt concentrator 14 that the interrupt service request has been honored. This 
fourth signaling, or signal mechanism, is denoted (4). 
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In response to such acknowledgement, interrupt concentrator 14 de-asserts its 
event pending signal from both p-code decoder 18 and instruction fetch unit 20. The 
state machine within interrupt concentrator 14 then returns to its initial idle state and 
awaits another priority interrupt. Those of skill in the art will appreciate that the 
5 interrupt service routine effectively removes the hardware interrupt that gave rise to 
the interrupt servicing~e.g. by setting or clearing an interrupt request-pending status 
bit in a hardware interface associated with the hardware device— upon execution of an 
instruction within the ISR. 

Referring to Fig. 3B, p-code decoder 18 includes a state machine, preferably 

10 coded in software or firmware or a combination thereof. The state machine first 

signals interrupt concentrator 14 that an instruction or instruction sequence dispatched 
to and currently in process within OOO execution unit 24 will result in an impending 
context switch. Such a state machine preferably includes a context switch prediction 
unit that predicts naturally occurring context switches in anticipation thereof. Such 

15 predictions are predicated on the routine, i.e. interrupt-independent, instruction stream 
that most recently was decoded by p-code decoder 18 for dispatch and execution by 
OOO execution unit 24. Such prediction unit may be realized in accordance with the 
invention in the form of hardware, firmware or software within core processor 16. 
This first signaling by, or signal mechanism within, the state machine of p-code 

20 decoder 1 8 is shown in Fig. 3B as a get-ready signal denoted (1). 

A state machine within interrupt concentrator 14 responds to the get-ready 
signal by readying the hot list of hardware interrupts requiring service. Those of skill 
will appreciate that such interrupts are prospective, and subject at least to reordering 
in terms of priority, before one or more of them is signified, as by the signaling of 

25 instruction fetch unit 20. When the hot list is established, the state machine within 

interrupt concentrator 14 signals a state machine within instruction fetch unit 20 to get 
one or more vectors, e.g. ISRs, ready for insertion into the normal instruction stream. 
This second signaling, or signal mechanism, is shown in Fig. 3B as a get-vector signal 
denoted (2). 

30 A state machine forming a part of instruction fetch unit 20 responds to the get- 

vector signal from the state machine of interrupt concentrator 14 as above. In other 
words, the state machine determines whether the vector corresponding with the first 
interrupt vector on the 'hot' list already resides in instruction cache 22 and if not then 
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reads memory to load instruction cache 22. When the code is resident within the 
cache the state machine within instruction fetch unit 20 signals p-code decoder 1 8 
with a code-ready signal. This third signaling, or signal mechanism, is denoted (3). 

In response to the code-ready signal, the state machine within p-code decoder 
18 determines how much time is available for interrupt servicing prior to a resumption 
of normal service. In other words, in accordance with the invention, the state 
machine within p-code decoder 18 estimates based upon its 'knowledge' of the nature 
and probable duration of the detected natural context switch and other core processor 
priorities and allocates a time slot to the instant interrupt servicing. The state machine 
within p-code decoder 18 then signals interrupt concentrator 14 if more interrupts can 
be handled during the present insertion cycle. The cycle thus may be repeated, as 
described above. This fourth signaling, or signal mechanism, is denoted (4). 

In response to the more-interrupts query, the state machine within interrupt 
concentrator 14 signals the state machine within p-code decoder 18 that its interrupt 
queue is empty, e.g. the repeated cycle is complete. Alternatively, the state machine 
within interrupt concentrator 14 signals the state machine within p-code decoder 18 
that it has an immediate service request. In the latter event, which represents a hybrid 
implementation of the two state machine embodiments, it will understood that the 
state machines within interrupt concentrator, p-code decoder 1 8 and/or instruction 
fetch unit 20 implement features of both architectures. All such state machine 
protocols are contemplated, and are within the spirit and scope of the invention. This 
fifth signaling, or signal mechanism, is denoted (5). 

Turning briefly back to Figs. 1 and 2, interrupt concentrator 14 may be 
understood now to include priority logic, or what may be thought of as a 'smart' 
multiplexer, 30 for ranking and selecting one or more hardware interrupts 12 as a 
pending event. Priority logic 30 may be micro-programmed by the resident operating 
system or may be responsive to signaling from p-code decoder 18 to optimally 
allocate core processor bandwidth under what may be a dynamically changing 
application environment. Such allocation logic 32 may form a part of a so-called 
fetch-and-decode unit 34 comprising instruction fetch unit 14 and p-code decoder 18. 
Alternatively, such allocation logic may form a part of a so-called dispatch-and- 
execute unit 36 comprising OOO execution unit 24 and p-code retirement unit 26. 
Finally, with respect to Fig. 1, a context switch prediction unit 38 is coupled with 
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fetch-and-decode unit 34 for predicting a naturally occurring context switch, as 
described above with reference to Fig. 3 A. 

Those of skill in the art will appreciate that priority logic 30 and allocation 
logic 32 may take alternative forms, within the spirit and scope of the invention, and 

5 that within the spirit and scope of the invention they may reside anywhere within 
apparatus 10 or a computer system of which apparatus 10 may be part. Thus, it may 
be seen that a block within interrupt concentrator 14 is designated identically as 
allocation logic 32, which allocation logic may be thought of as a distributed function 
of apparatus 10 made in accordance with the invention. 

10 Fig. 2 also shows a block labeled interrupt control logic 40, which will be 

understood to control the operation of priority logic 30 and of a state machine 42 
within interrupt concentrator 14. Finally, Fig. 2 shows the above-described hot list 
buffer 44 that in accordance with the invention buffers those priority interrupts 
meeting the priority and timing criteria for signaling a pending event to core processor 

15 10, as described. Those of skill in the art will appreciate that interrupt concentrator 14 
may within the spirit and scope of the invention take alternative forms, and that it is 
preferably implemented in a combination of high-speed hardware and firmware to 
perform the needed interrupt concentration and signaling functions. 

Fig. 4 is a flow chart illustrating the bandwidth-allocation method in 

20 accordance with the invention. The bandwidth-allocation method in one aspect of the 
invention starts at 100 with the detection of one or more hardware interrupts on one or 
more interrupt lines. At 102, the interrupt is queued, as in a buffer not shown, and its 
priority level determined. At 104, interrupt concentrator 14 determines whether the 
interrupt meets the current bandwidth-allocation criteria for either interrupting core 

25 processor 16 via event signal mechanism (1) to p-code decoder 18 and to instruction 
fetch unit 20 (refer Fig. 3 A). It will be understood that the current bandwidth- 
allocation criteria preferably are maintained by interrupt concentrator 14 as 
programmed variables that are initialized and updated by an external allocation agent 
such as OOO execution unit 24 or the resident OS. 

30 In this manner, interrupt servicing is responsive not only to the priority of the 

interrupts but also to the capacity of the processor to allocate bandwidth to interrupt 
servicing. If the criteria are not met, then control returns to block 100 where a further 
interrupt is awaited. If on the other hand it is determined at 104 that the criteria are 
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met, then interrupt concentrator 14 at 106 performs the signaling described in detail 
above. This signaling is illustrated in Figs. 2 and 3 A, denoted therein as signal 
mechanisms (2)-(4), and in Figs. 2 and 3B, denoted therein as signal mechanisms (2)- 
(5). These steps may be repeated, of course, as suggested by Fig. 4. 
5 Such illustrated processor bandwidth allocation will be understood by those of 

skill in the art to be realized in accordance with the invention by establishing the 
criteria and communicating the same to interrupt concentrator 14 and thereafter by 
updating them as necessary to reflect current application-specific sensibilities. In this 
way, changing interrupt service demands are met with more or less resistance based 
10 not upon fixed criteria or underlying assumptions about the relative priority of normal 
and exception handling, but rather upon notions of greater responsiveness and 
productivity. 

Turning now very briefly to Fig. 5, it may be seen that a computer system as 
illustrated may incorporate the invention, whether in the form appeiratus 10 or the 

15 invented methods described and illustrated herein. Those of skill in the art will 

appreciate that the computer system as illustrated may include such I/O devices as a 
keyboard, mouse, monitor or external disc drive, operatively connected to a personal 
computer or workstation or server in the form of a tower or other hardware platform. 
Typically, those of skill in the art will appreciate that apparatus 10 would take the 

20 form of one or more processor integrated circuits (ICs) mounted on processor boards 
having memory devices and I/O interfaces including interrupt hardware. It is by virtue 
of the invention described and illustrated herein that such computer systems 
incorporating the invention will be more responsive to their users' changing needs in 
diverse new applications that are increasingly I/O intensive. 

25 Having illustrated and described the principles of our invention in a preferred 

embodiment thereof, it should be readily apparent to those skilled in the art that the 
invention can be modified in arrangement and detail without departing from such 
principles. We claim all modifications coming within the spirit and scope of the 
accompanying claims. 
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